Detecting Concept Drift in Financial Time Series Prediction using Symbolic Machine Learning
نویسنده
چکیده
This paper investigates the use of strategies to enhance an existing machine learning tool, C4.5, to deal with concept drift and non-determinism in a time series domain. Temporal prediction is a difficult problem faced in most human endeavours. While many specialised time series prediction techniques have been developed, these techniques have limitations. Most are restricted to modeling whole series rather than extracting predictive features and are difficult for domain experts to understand. Symbolic machine learning promises to address these limitations. Symbolic machine learning has been very successful on a broad range of complex problems. To date, few attempts have been made to apply symbolic machine learning directly to temporal prediction. This has resulted in systems that cannot explicitly represent temporally ordered examples or handle changing target concepts. Financial prediction is a challenging target domain, which is temporally ordered, has target concepts that change over time and exhibits a high level of non-determinism. Financial markets are considered to be unpredictable by many academics and thus any improvement on chance is interesting. An aim of this study is to demonstrate that machine learning is capable of providing useful predictive strategies in financial prediction. For short term financial prediction a successful prediction rate of 60% is considered the minimum useful to domain experts. Our results imply that machine learning can exceed this target with the use of new techniques. By trading off coverage for accuracy, we were able to minimise the effect of both noise and concept drift. The work reported was in collaboration with and funded by Australian Gilt Securities Limited. Michael Harries was supported by an Australian Postgraduate Award (Industrial).
منابع مشابه
Machine learning algorithms for time series in financial markets
This research is related to the usefulness of different machine learning methods in forecasting time series on financial markets. The main issue in this field is that economic managers and scientific society are still longing for more accurate forecasting algorithms. Fulfilling this request leads to an increase in forecasting quality and, therefore, more profitability and efficiency. In this pa...
متن کاملDetecting Concept Drift in Data Stream Using Semi-Supervised Classification
Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...
متن کاملStock Price Prediction using Machine Learning and Swarm Intelligence
Background and Objectives: Stock price prediction has become one of the interesting and also challenging topics for researchers in the past few years. Due to the non-linear nature of the time-series data of the stock prices, mathematical modeling approaches usually fail to yield acceptable results. Therefore, machine learning methods can be a promising solution to this problem. Methods: In this...
متن کاملConcept drift detection in business process logs using deep learning
Process mining provides a bridge between process modeling and analysis on the one hand and data mining on the other hand. Process mining aims at discovering, monitoring, and improving real processes by extracting knowledge from event logs. However, as most business processes change over time (e.g. the effects of new legislation, seasonal effects and etc.), traditional process mining techniques ...
متن کاملTime series forecasting of Bitcoin price based on ARIMA and machine learning approaches
Bitcoin as the current leader in cryptocurrencies is a new asset class receiving significant attention in the financial and investment community and presents an interesting time series prediction problem. In this paper, some forecasting models based on classical like ARIMA and machine learning approaches including Kriging, Artificial Neural Network (ANN), Bayesian method, Support Vector Machine...
متن کامل